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ABSTRACT 


DATA  ENVELOPMENT  ANALYSIS  AND  AXIOMATIC  NOTIONS 
OF  EFFICIENCY  AND  REFERENCE  SETS 
By  A.  Charnes,  W.W.  Cooper,  J.  Rousseau  and  J.  Semple 
The  University  of  Texas  at  Austin 


Serious  mathematical  and  computational  errors  and  misstatements 
culminating  In  erroneous  characterizations  of  Data  Envelopment  Analysis  (DEA) 

Y  J?" 

models  and  methods  and  their  relationship  to  the  axiomatic  production  models 
of  Shepard  type  by  Fare,  Hunsaker  and  others  are  corrected  together  with  new 
exposition  and  contrast  of  current  DEA  methodology  with  the  Shepard  axiomatic 
modelling  types.  The  stochastic  base  of  DEA  Is  shown  to  be  uncertainty,  not 
risk.  A  new  computationally  effective  extended  additive"  CEA)  model  Is 
developed  to  handle  processes  with  Input  thresholds  and  output  ceilings  and 
thereby  not  subsumable  under  Shepard  axlomatics. 


KEY  WORDS 

Data  Envelopment  Analysis 
CCR  Ratio  Model 
Multi-objective  Programming 
Efficiency  Analysis 
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DATA  ENVELOPMENT  ANALYSIS  AND  AXIOMATIC  NOTIONS 
OF  EFFICIENCY  AND  REFERENCE  SETS 


By  A.  Charnes,  W.W.  Cooper,  J.  Rousseau  and  J.  Semple 

Introduction 

The  Management  Science  paper  (Vol.  32,  2,  February  1986)  of  Fare  and  Hunsaker, 
"Notions  of  Efficiency  and  Their  Reference  Sets,"  contains  serious  mathematical  aro 
computational  errors  and  misstatements  culminating  in  erroneous  characterizations  of  Data 
Envelopment  Analysis  (DEA)  models  and  methods  and  their  relationship  to  the  "axiomatic" 
production  models  of  Shepard  type.  In  particular,  every  (sic)  DEA  example  solution  presented  is 
erroneous,  plus  the  stated  ratio  model  itself  is  erroneous  despite  their  bibliographic  citations  of 
papers  [I],  [2]  containing  the  correct  non- Archimedean  model.  Again,  the  paper  presents 
axioms  stated  in  obsolete  mathematical  terminology  plus  incomplete  specification  so  that 
equivalence  claims  for  models  and  more  appear  to  be  technically  unsupported  . 

In  the  following,  we  correct  such  errors,  exposit  and  contrast  the  DEA  methodology 
with  that  which  rests  on  the  complex  Shepard  axiomatic  production  theory.  The  cited  examples  of 
the  latter  structures,  re  efficiency  considerations,  are  subsumed  in  those  CCR  ratio  mocels 
which  have  ''reference,"  or  "production  possibility,"  sets  of  conical  type.  These  conical  sets,  as 
recognized  by  Shepard  and  Farrell  [26],  do  not  accurately  describe  production  possibilities  for 
many  processes.  We  therefore  bring  to  the  fore  some  of  these  real  phenomena  and  data  aspects 
and  their  Implications  for  desiderata  in  reference  sets,  modeling,  computation,  informational 
presentation  and  management  evaluation.  In  particular  we  present  a  new  computationally 
effective  extension  of  the  (non-ratio  C2GS2)  DEA  model  of  [3],  which  we  call  the  "extended 
additive"  (EA)  model,  which  can  properly  take  account  of  processes  with  input  thresholds  arc 
output  ceilings  and  thereby  a  whole  range  of  discretionary  to  non-discretionary  variable 
configurations  not  subsumable  under  the  Shepard  axiomatics. 


- 
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Non-Archlmedean  Errors 

As  presented  orally  and  In  print  (xerox)  at  a  session  of  the  Rutgers  University 
Productivity  Conference  in  1981  of  Dr.  A.  Dogramaci  ana  attended  by  R.  Fare  ana  as  presertec  :r 
the  Fare  -  Hunsaker  (F-H)  bibliography  [l]  [2],  the  non- Archimedean  CCR  ratio  mocel  is  ir 
simpler  notation) 

max  h  =  u^yo  /  vTx0 

(  1  )  uTyj  /  vTXj  <  I ,  j  =  1  . .  ,  n 

-uT  /  vTx0  <  -e  eT 
-vT  /  vTx0  <  -e  eT 

where  (x j ,  yj)  is  the  observed  input  -  output  vector  of  DMUj,  0  ("zero")  designates  the  Dml 
whose  efficiency  is  being  estimated,  eT  is  a  vector  of  "ones”  and  e  is  the  (positive-  <Ncr- 
Archimedean  infinitesimal. 

By  the  Charnes  -  Cooper  transformation  jjl  =  tu.  =  tv,  tvTx0  =  l ,  it  transforms  :rt : 
the  linear  programming  problem  (R)  with  dual  (DEA),  the  Data  Envelopment  side, 

(R)  (DEA) 

max  h  =  pTy0  min  6  -ee's+-eeTs' 

vTx0  =  I 

(1.1)  pTY  -  -0TX  <  0  YX  -  s+  =  uc 

-gT  <  -ee^  9x0  -xx  -s'  *  0 

-  ^  <  -ee 

X,  s+,  s'  >  0 


x  A  [X ,  .,  xn] 


Where  Y  A  [y  | ,  .  .  .,  yn] 


Evidently  the  optimal  value  will  be  of  "complex  number"  form  a  -  be  ,  with  a  =  9*  representing 
the  "scale"  efficiency  ana  b  representing  eTs*+  +  e  s*-,  the  sum  of  optimal  slacks  (input 
surpluses  ana  output  shortfalls). 

instead,  F  -  H  erroneously  rencer  ( l )  as 

ERRONEOUS  MODEL 
max  h  =  u^y0  /  vTx0 

(  2  )  u^yj  /  v^x,  <  I  ,  j  =  1 .  .  ..  n 

T  T 

<  -ee1 

_VT  <  _eeT 

Using  subscript  2  in  place  of  the  confusing  subscript  0  of  F-H,  their  example 
problems  may  be  written  as 

(21)  max  u/(2"v  |  +  5"v2^ 

u/(2  v ]  +  5  V2'  <  I 

u/(2  v  t  *  6  V2>  <  1 

u,  v  | ,  V2,  >  e 

arc 

'22)  max  u /  ( 2  v  |  ♦  6  v 2 ) 

u/(2  v |  ♦  5  V2)  <  I 
u/(2  Vj  +  6  V2)  <  I 
u,  v  | ,  V2  >  e 

where  the  Quotation  marks  denote  multiplication  by  a  (the  same)  power  of  1 0 

Setting  W|  *  u/(2  vj  +  5  V2),  W2  =  u/(2  v  |  ♦  6  V2),  sirce  the  u  arc  ■, ,  are  r  t- 
regative,  we  have  always  w  |  >  W2,  thus  the  second  constraint  is  recurcart  For  cptima'if^  - 
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Thereby  w 2 *  =  (2  V]  *  +  5  V2*)  /  (2  v|  *  ♦  6  V2*5  =1-1  V2*/(2  V]  *  +  6  V2*5. 

Evidently  DMU|  is  efficient  (w  |  *  =  1 )  in  all  of  these  examples  and 
(2.1.1)  V|*  =  I,  v2*  =  1,  u*  =  2"  *  5" 

is  one  of  the  infinite  number  of  optimal  solutions,  which  further,  need  not  involve  the  rcr- 
Archimedean  infinitesimal  ! 

Because  of  W2  =  I  -  l”v2/(2"v|  +  6"v2)  for  optimality  in  (2.2),  there  is  n£ 
maximum  but  only  a  supremum  value  of  one  (“efficiency").  For,  taking  v  |  =  pv2  and  V2  =  l ,  we 

have 

(2.2.  1  )  h  =  W2  =  I  -  l/(2p  +  6)  ->  1  as  p  -*  » 

Even  further,  as  may  be  suspected  from  the  above,  (2),  which  are  F  -  H's  ecuaticrs 
(2.2)  on  page  238,  may  be  reduced  to  an  eouivalent  form  without  the  non-Archimecear 
infinitesimal  e  !  Setting  u  =  eu,  v  =  ev,  (25  becomes 

max  uTy0/v '  x0 

uTy )/vTXj  <  I ,  j  =  l ,  .,  n 

(3)  uT  >  eT 

vT  >  eT 

The  F  -  H  "solution",  h  =1/(1  *  e),  V2  =  e,  V|  =  (l  -  5e)/2,  u  =  1  cr  p  239, 

ecuation  (2.4)  and  its  accompanying  statement  "Since  the  solution  to  problem  (2.35  is  rot  2  real 
rumber,  the  value  of  h  cannot  be  computed"  are  erroneous,  witness  (2.2.  l )  arc  (3) 

The  F  -  H  "solutions"  of  their  example  2,  h  =  1/8,  u  =  l ,  v j  =  V2  _  &  arc 

h  =  1  /7,  u  =  1 ,  v |  =  v 2  =  1 0~6  (for,  respectively,  DMU2  and  DMU | )  and  their  accompanying 
statement  "Thus  no  one  DMU  is  efficient"  are  erroneous  since  by  (221)  sup(h)  =  1  for  DMU 2 
and  by  (2  1  l ),  h  =  l  for  DMU  | 
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Using  instead  on  example  2  the  correct  CCR  ratio  mode)  ( 1 )  and  the  PEA  side  of  the 
linear  programming  eguivalent  (i  n,  from  which  all  Data  Envelopment  Analysis  computations 
have  been  performed  heretofore,  we  obtain  by  trivial  observation  the  optimal  basic  solutions 

(131)  9*^1,Ai*=1JS2*“  =  Owith  dual  optimum  y*  =  1,  U]*  =  (1  -  5  e)/2 

U2*  =  e  with  h*  =  u*  =  1  for  DMU ; . 
and 

(1.32)  e*=11A|*=11S2*“=l"with  dual  optimum  y*  =  1  -  l"  e, 

v i  *  =  ( 1  -  1  e )/2  ,  "02*  =  e  with  h*  =  y*  =  1  -  1  "e  for  DMU2- 

_hus,  correctly,  DMU]  is  rated  efficient  and  DMU2  is  not. 

For  the  last  few  years,  a  real  efficiency  value,  if  desired,  has  been  oeterminec  from  the 
optimal  real  9,A,s\s"  as 

( 1.3.3)  9*[  1  -  (e7s"+  +  eTs*“)  /  (eTx0  ♦  eTy0  +  e's**  +  e~s**“)] 

in  example  2,  DMU]  thus  has  value  l  and  DMU2  has  value  approximately  0.9. 

Most  vitally,  the  CCR  ratio  model  always  has,  via  the  dual  theorem  of  !irear 
programming  applied  to  ( l .  1 ),  a  basic  optimal  solution  pair,  hence  a  true  minimum  for  DEA  arc 
a  true  maximum  for  (R).  Evidently,  from  their  own  example,  the  erroneous  F  -  H  mccel  oces 
rot. 
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Computation  and  Informatics 

Computation  for  (1.1)  has  been  aval  1  able  from  the  first  (1951)  lr  Charnes’  rcr- 
Archimeaean  simplex  method  ana  solution  of  the  "cegeneracy"  problem  [4],  ir  classes  for  the  past 
35  years  and  in  the  1961  Channes  -  Cooper  text,  Management  Models  ana  Industrial 
Applications  of  Linear  Programming"  [5],  by  the  use  of  additional  columns  anc  rows  ccntainrc 
the  (base  field)  coefficients  of  the  non-Archimedean  guantities.  The  NONARC  code  of  Dr.  Icbal  Ali 
(of  The  University  of  Texas  at  Austin)  will  also  solve  such  non-Archimedean  linear  programming 
problems  by  a  2-phase  process  similar  to  the  Phase  I,  Phase  II  process  of  usual  simplex  method 
codes,  which  actually  solves  a  similar  non-Archimedean  problem.  Additionally,  ordinary 
simplex  codes  can  be  used  for  e  sufficiently  small  numerically  relative  to  the  other  matrix  data 
reciprocals.  The  infinitesimal  must  however  be  taken  larger  than  the  numerical  zero  tolerance. 
Routinely  also,  before  computation,  linear  programming  codes  scale  the  matrix  of  coefficients  by 
change  of  units  or  multiplication  across  rows  so  that  the  entries  lie  in  the  0-100  range. 

Toward  both  purposes  as  mentioned  in  [2],  cited  in  the  F  -  H  bibliography,  the 
cceff icieht  of  9  in  the  functional  of  (1.1)  has  been  enlarged  to  1 00  (correspording  to  percert 

scale  efficiency)  or  more.  And  one  can  even  multiply  the  functional  by  e~ 1  leaving  9  with  the 
only  non-Archimedean  coefficient.  As  shown  by  the  exact  solutions  for  u* ,  D]*,  X2*  in  (1  3.1 : 

ana  ( 1 .3.2),  numerical  difficulties  are  certain  for  computation  in  example  2  with  cuctaticr 
marks  corresponding  to  10^  and  e  =  10'&.  Enlargement  of  the  coefficient  of  5  ir  the 
functional  and/or  scaling  as  stated  in  [2]  has  sufficed  to  remove  such  difficulties  ir  all  past 
applications  handled  by  professionals  in  mathematical  programming  computation. 

^he  major  problem  in  DEA  analysis  computations  has  ret  beer  arc  is  ret  such 
numerical  niceties,  but  rather  in  solving  the  informatics  problem  of  providing  the  scluticr  tc 
DEA  problems  for  all  the  DMUs  with  preservation  and  extraction  of  the  detailec  irfcrmaticr 
neeced  for  analysis.  For  example,  one  recuires  "window  analyses"  based  on  efficiency  values  [6], 
arc  "envelopment  maps"  for  assessing  stability,  possible  data  errors  and  robustness  of  the 
empirical  ^areto-Koopmans  optimal  production  function  together  with  time  series  behavior  [7] 


This  is  in  addition  to  providing  solution  detail  (on  the  PEA  side)  for  every  DMU  Further, 
contrary  to  the  impression  given  by  the  F-H  paper,  there  exists  not  one  but  a  multiplicity  cf  DEA 
models,  the  software  for  which,  thus  far,  has  been  elided  so  far  as  possible  so  that  with  a  smcle 
software  package  one  can  call  up  a  DEA  analysis  for  any  one  of  these  types. 

Development  of  such  efficient  packages  has  been  possible  because  at  most  only  the  right 
hand  side,  a  column  and  the  functional  need  be  changed  in  passing  from  one  DMU  to  another  Also, 
the  invariant  multiplicative  DEA  model  [8]  has  the  same  formal  structure  as  the  "accitive" 
C^GS2  model  [3],  Because  the  analyses  required  from  most  models  cemand  extersive 
computation,  it  is  vital  when  developing  new  models  for  other  production  possibility  sets  to  tie  ir 
their  specifications  as  closely  as  possible  to  the  computational  capabilities  of  existing  software 
mccules  and  the  additional  data  which  can  be  accessec  easily  through  simple  extersicrs. 
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Shepard  Axiomatlcs 

Shepard  formulated  a  theory  of  production  [  1 2]  for  a  restrictea  class  of  procucticr 
technologies  which  culminated  in  an  elegant  transform  theory  between  production  arc  cost  aspects 
cal  lee  Shepard-Samuelson  Duality.  More  recently  others  such  as  his  stuoent  R.  Fare  arc  the 
economist  R.  Russell,  have  focused  on  extending  this  theory  to  encompass  efficiency 
considerations  which  thereby  Involve  certain  abstractly  defined  frontier  sets  related  to  the 
Shepardian  characterization  of  efficiency.  Their  work  involved  similarly  the  complex 
mathematical  constructs  of  point-to-set  maps  ("correspondences,"  in  obsolete  termirolcgy)  of 
Shepard  so  that  to  exercise  this  theory  on  particular  technologies  it  is  necessary  to  identify  them 
as  satisfying  Shepard-type  axioms. 

~hese  axioms  typically  refer  to  a  point-to-set  map  L  transforming  an  output  vector  u 
irto  the  set  L(u)  of  all  input  vectors  x  which  can  produce  the  output  vector  u.  a  key  axiom  is 

L 4  L(o)  is  a  closed  correspondence 

~h i s  axiom  is  not  verified  in  the  F-H  paper,  inceed,  "closed  correspondence"  is  ret  ever  cef  ;rec 
there  or  in  other  papers  in  the  field.  ~he  Fare,  Grosskopf,  Lovell  moregraph  [9]  (p  203)  gives 
an  incomplete  (and  typographically  erroneous)  definition.  Russell,  perhaps  ccrfusee  bu  the 
archaic  terminology  and  absent  topolegies  he  attempts  to  correct,  rercers  this  axiom  irccrrectly 
as 

L4‘  L(u)  is  a  closed  set  (of  input  points1). 

’o  define  a  closed  map  reauires  the  specification  of  three  topologies  —  ore  for  tKo 
cemain,  one  for  the  range,  arc  one  for  the  product  topology  of  the  two.  A  map  may  be  closed  or  rot 
deperairg  on  which  topologies  are  prescribed  for  these  three  topological  spaces  a  most  clear 
exposition  and  examples  are  given  on  p  3 1 2-3 1 3  of  the  Narici  and  Becker  stem  text  [  i 0]  ~re 
correct  definition  follows. 

Let  S  and  ~  denote  topological  spaces  and  5  X  ~  their  tepo logical  prccuct  space  Let f  ce  a 
map  from  5  into  ~  The  graph  of  f,  G(f),  is  the  set  of  pairs  Us,  f(s)).  s  e  5} 

Defiriticr  S  is  a  "closed  map"  iff  G(f 3  is  a  closed  subset  of  S  X  ~  in  the  product  topology 


We  note  that  the  definition  of  graph  on  p.  25  of  Fare,  Grosskopf,  arc  Lovell,  ecuafcr 
(2.1.3),  is  thereby  incorrect  since  it  involves  pairs  of  input  output  vectors  rather  thar  pairs 
(u,  L(u))  of  output  vector  u  ana  image  L(u),  the  latter  a  point  in  the  space  of  all  sets  of  irpui 
vectors  x.  No  topology  is  specified  for  the  latter  as  points  nor  for  the  topological  prccuct  ~hus 
the  Fare,  et  al,  identifications  are  invalid  since  the  axioms  to  be  satisfied  are  macecuately  or 
erroneously  defined. 
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Data  Envelopment  Analysis 

Contrary  to  the  impression  given  in  the  Fare  ana  Hunsaker  paper,  Data  Envelcprrert 
Analysis  is  a  methodology  which  in  relation  to  notions  of  efficiency  ana  reference  sets  has 
developed  important  sets  of  models  and  new  relations  between  classical  efficiency  notions, 
economic  production  theory,  as  well  as  new  auditing  and  operational  aspects  of  management  cf 
productive  processes  undertaken  by  a  multiplicity  of  organizational  or  response  units.  ~he  first 
DEA  model,  the  non-Archimedean  CCR  ratio  model,  established  a  connection  between  the  classic 
scientific-engineering  notion  of  efficiency  and  the  Farrell  economic  production  theory  notion  by 
generalizing  the  scientific-engineering  efficiency  definition  to  multiple  input,  multiple  output 
systems  and  to  relative  efficiency.  These  were  connected  by  mathematical  programming  duality 
as  in  the  (R)  and  (DEA)  of  equations  (1.1). 

The  combination  of  multiple  inputs  (multiple  outputs)  into  a  single  virtual  input 
(virtual  output)  is  possible  in  an  infinite  number  of  other  ways  than  in  (l.l),  eg,  a 
multiplicative  one  by  raising  inputs  (outputs)  to  nonnegative  powers  and  multiplying  them 
together  to  obtain  a  single  virtual  input  (virtual  output).  The  resulting  ratio  model,  on  taking 
logarithms  of  input  and  output  entries,  reduces  to  a  linear  programming  model  with,  crce  acair, 
the  dual  DEA  side  corresponding  to  a  measure  of  relative  efficiency  for  an  explicitly 
(analytically)  stated  production  possibility  set  [11],  The  form  of  this  which  gives  an  efficiency 
measure  independent  of  the  units  in  inputs  and  outputs  yields  dually  always  a  piece-wise  Ccbb- 
Douglas  economic  production  function  [8], 

The  key  paper  of  Charnes,  Cooper,  Golany,  Seiford  and  Stutz  [3]  established  the  fact 
that  the  extremal  models  of  Data  Envelopment  Analysis,  on  the  DEA  side,  were  the  Charres- 
Cooper  test  for  Pareto-optlmality  of  a  test  point  (input-output  vector)  [131,  [14],  "hus,  the 
essence  of  the  DEA  methodology  is  in  providing  a  test  for  multi-objective  optimality  of  a  specif lec 
point  (here  a  possible  DMU  input-output  vector)  chosen  from  a  specified  reference  set  (here 
production  possibility  set)  which  yields  simultaneously  quantitative  values  of  input  surpluses 
and  output  shortfalls  needed  to  obtain  efficiency. 
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Recognizing  this  connection,  Charnes,  Cooper,  and  Thrall  [15]  aevelopec  ar 
Archimedean  characterization  of  efficiency  types  for  the  CCR  ratio  model  in  which  scale 
efficiency  and  technical  efficiency  are  chacterized  by  the  dimensionality  of  the  set  of  optimal 
solution  vectors  to  the  correspondent  of  (R)  in  ( l ).  Via  semi-infinite  programming  this  has 
been  generalized  to  an  infinite  number  of  DMU's  by  Charnes,  Cooper,  ano  Wei  [16]  with 
corresponding  generalizations  of  other  models  forthcoming.  Further,  another  direction  of 
generalization  of  the  ratio  models  has  been  to  arbitrary  closed  convex  cones  for  the  virtual 
multipliers  and  for  the  production  possibility  sets  and  data  envelopments  by  Charnes,  Cooper, 
Wei  and  Huang  [17]  with  extensions  to  other  models  forthcoming  as  well  The  dimensionality 
construct  in  Charnes,  Cooper  and  Thrall  [15]  has  been  simplified  (and  generalized  to 
inferiority  properties  relative  to  the  cone  of  virtual  multipliers  [  1 6],  Thus,  this  extension  anc 
the  semi-infinite  extension  represent  generalizations  to  possibly  nonpolyhedral  reference  anc 
virtual  multiplier  sets  which  involve  thereby  nonlinear  programming  and  duality  relations 

Before  proceeding  to  a  new  computational ly-effective  DEA  model,  we  need  to  correct  a 
misimpression  by  the  noted  economist,  P,  Schmidt  [18],  to  the  effect  that  Data  Envelopment 
Analysis  is  a  deterministic  and  not  a  statistical  methodology.  The  fact  is  that  all  the  DEA 
methodology  from  Farrell  onward  involve  determinations  of  relative  efficiency  ano  of  efficient 
(frontier)  production  functions  based  on  sample  observations.  By  the  mathematical  cef iriticr  of 
the  word  "statistic"  the  quantities  calculated  are  all  statistics.  What  is  true  is  that  so  far  we  nave 
only  a  bare  beginning  of  statistical  theory  for  these  statistics  [3],  A  particular  problem  is  new 
one  should  characterize  "waste"  (input  surpluses)  and  "possible  achievement  failures"  (output 
shortfalls)  stochastically.  This  is  not  addressed  adeouately  by  parametric  formula  fittirc 
according  to  maximum  likelihood  or  other  classical  statistical  estimation  schemes  which. 


Incidentally,  have  proven  substantially  inferior  to  DEA  methocs  in  ccmectly  estimatmc  vrcwr' 


frontten  formulae  from  random  samples  [  1 9],  Failing  such  a  theory,  devices  such  as  wmccw 
analysis  [6]  and  envelopment  maps  [7]  have  been  cevelopeo  to  exp  anc  sample  sizes  to  achieve 
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The  Extended  Additive  (EA)  Model 

Meaningful  determination  of  relative  efficiency  should  involve  comparisons  of  a  DMU  to 
actual  production  possibilities  only.  In  some  applications,  the  model  employed  has  efficiency 
determination  and  measurement  of  input  surpluses  and  output  shortfalls  referred  to  efficient 
points  which  are  not  possible.  Such  relative  efficiency  (and  effectiveness)  determinations  are 
thereby  invalid.  Many  of  these  applications  have  nevertheless  proved  worthwhile  because  the 
most  important  determinations  made  have  been  (correctly)  with  reference  to  efficient  inputs  and 
outputs  that  were  possible.  Examples  of  this  kind  have  occurred  when  some  input  variables  are 
nondiscretionary,  i.e.,  are  not  controllable  by  a  DMU  manager,  such  as  temperature  or  weather 
or  unemployment  rate.  The  modification  suggested  by  Banker  and  Morey  [20]  involving  a  change 
in  the  functional  is  not  appropriate  since  Pareto-optimal  (efficient)  DMU's  may  fail  to  be 
recognized  because  the  modified  functional  is  Incorrect  for  the  Charnes-Cooper  test  plus  the 
partly  conical  character  of  the  reference  set  (production  possibility  set)  may  lead  to  an 
impossible  efficient  referent.  Again,  many  production  processes  involve  input  thresholds  below 
which  the  process  does  not  work  and  output  ceilings  above  which  output  is  impossible  (as,  for 
example,  in  sales  in  a  market,  the  number  of  purchasing  households  in  the  area  cannot  exceed  the 
number  of  households  in  that  area.). 

In  the  key  paper  [3]  "Foundations  of  Data  Envelopment  Analysis  for  Pareto  Optimal 
Empirical  Production  Functions,"  a  model  which  today  we  call  the  "additive"  model  was  put 
forward  which  remedied  the  production  possibility  set  difficulty  of  cones  and  provided  a  basis  for 
other  more  valid  reference  sets.  We  now  extend  this  model  to  one  which  also  takes  care  of 
thresholds  and  ceilings  as  well  as  permitting  a  variable  to  range  from  discretionary  to 
nondiscretionary  in  a  way  which  is  trivially  implementable  on  existing  DEA  software. 


The  Chames-Cooper  test  for  Pareto  Optimality  (here  Pareto-Koopmans  Optimality)  of 
DMU0,  using  as  reference  set  the  convex  hull  of  the  input-output  vectors  of  DMU  |  to  DMUn,  may 

be  written  (see  [3])  as 

min  -  eTs*  -  eTs" 

yx  -s*  =  y0 

( A.  1  )  -XX  -S'  = -xQ 

eTX  =  1 

X,  s*,  s*  i  0 

If  one  wishes  this  value  to  be  independent  of  the  units  of  measurement  we  alter  the  functional  to 
(A. 2)  -eTD'UyQls*  -  eTD-,(x0)s'  =  -dw(s+,  s') 

where  D(y0),  D(x0)  are  diagonal  matrices  with  the  y0  or  x0  component  entries.  (If  some 
components  of  y0  or  x0  are  zero  we  use  the  unigue  Moore  -  Penrose  generalized  inverse  which  has 

zeros  instead  of  reciprocals  for  the  zero  components.)  For  either  functional  we  obtain  the  same 
DMUs  as  efficient  since 

(A. 2.1)  a  (eTs*  ♦  eTs") <  eTD~,(y0)s<'  +  eTD"'(x0)s'  <  p  (eTs+  +  eTs") 

where  a,  p  are  respectively  the  minimum  and  maximum  of  the  non-zero  entries  of  both  D~ 1  (y0) 
and  D'Uxq)  and  therefore  a,  p  >  0  .  It  is  useful  to  have  the  (A.  1 )  functional  form  both  for  theory 

and  practice  since  the  efficiency  hyperplane  it  defines  (eTs*  +  eTs'  =  0)  does  not  depend  on  any 
particular  x0,  y0. 

When  DMU0  is  Inefficient,  its  precise  efficiency  measure  is  not  the  important 

consideration,  but  rather  the  input  surpluses  and  output  shortfalls  needed  for  efficiency  (i.e.,  to 
be  on  the  economic  (Pareto-Koopmans  Optimal)  production  function  surface)  together  with  the 
"facet"  of  efficient  referent  DMUs  determining  the  local  production  function.  These  are  the 
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significant  managenially  meaningful  quantities.  Using  the  (A.2)  functional,  a  computationally 
easy  efficiency  scone  using  the  weighted  1 ,  -metric  of  (A.2)  is 

(A.2.2)  exp{-dw(s’,  s')/[dw(y0,  x0)  +  dw(s’,  s~)]}  ~  l-dw(s’,  s')/[s+m  ♦  aw(s’,  s')] 


(The  latter  may  be  interpreted  as  a  ratio  of  distance  measures  as  in  [2 1  ]).  Note  that  this  units 
invariant  score  may  be  applied  equally  well  to  the  multiplier  of  0*  in  ( 1 .3.3). 

We  now  present  the  Extended  Additive  (EA)  model  for  efficiency  rating  of  DMU0  as. 


V 

I1 


min  -  eTs’ -  eTs‘  ,  or  -eTD"'(y0)s’  -eTD"’(x0)s' 

YX  -s’  =  y0 

(EA)  -XX  -S'  =  -x0 

eTX  =  I 

$,'  s  PjX0|,  i  =  l, ....  m 

sr  *  ^Vyon  r  =  I ,  ■  ,  S 

X,  s’,  s'  >  0 

In  this,  by  adjusting  the  value  of  Pj  from  zero  to  one,  we  go  from  a  nondiscretionary  input  to  a 

completely  discretionary  input.  Practically,  since  the  numerical  value  of  a  nonciscreticraru 
input  is  not  that  precise,  one  may  use  always  a  small  value  of  Pj  dependent  on  the  character  of  the 

input  for  DMU0.  Raising  the  value  of  Pj  corresponds  to  not  allowing  the  referent  input  value  tc  cc 
below  a  specified  level  less  than  the  ith  input  value  of  DMU0.  I  e ,  the  referent  input  must  exceec 


this  threshold  input  level. 

Similarly,  by  adjusting  the  value  of  rr  from  zero  to  one  we  go  from  the  fixed  cutout  r 
level  of  DMU0  to  anything  permitted  by  the  other  constraints.  Raising  the  value  of  j> 

corresponds  to  not  allowing  the  referent  output  value  to  exceed  a  specified  level  above  the  rth 
output  level  of  DMU0  l.e.,  the  referent  output  cannot  exceed  this  ceiling  output  level  'he  use  cf 


nsrs 


l.'VW 


v.v//.  /.v 


such  individual  upper  bcunds  cn  the  slacks  poses  no  computational  problem  in  Data  Ervelcpmert 
Analysis  since  every  linear  programming  code  of  any  merit  today  incorporates  the  lrcivicual 
upper  bound  algorithmically  without  reouiring  its  explicit  specification  as  a  ccrstrairt. 

^hus  the  EA  model  permits  the  automatic  incorporation  of  real  local  restrictions  or 
many  processes  which  are  not  permitted  by  the  other  existent  mocels.  As  such,  the  associatec 
^aretc-Koopmans  efficiency  surface  is  a  more  valid  representative  of  the  true  nature  of  the 


processes. 
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Conclusions 

"o  summarize,  contrary  to  the  Fare,  Hunsaker,  et  a  1 ,  perjcratives,  Data  Envelopment 
Analysis  is  a  statistical  methodology  whose  models  consist  of  the  Charres-Ccoper  test  for  a 
^areto-Koopmans  mirimum  applied  to  test  point  observations  from  a  specifiec  reference  set  'he 
model  may  be  a  linear  programming  model,  or  a  transformably  convex  programming  mccel  in 
wmch  the  functional  may  or  may  not  be  linear,  and  in  which  the  reference  set  may  or  may  net  be 
polyhedral  and  may  (as  in  EA)  or  may  not  vary  with  the  test  point  (=DMU) 

"he  CCR,  Multiplicative,  BCC  [22],  Additive  and  EA  models  provice  different 
production  possibility  (reference)  sets  for  Data  Envelopment  Analysis,  its  connection  with 
multi-criteria  programming  is  through  the  Charnes-Cooper  test  for  multi-criteria  optimality, 
here  Dareto-Kocpmars  Optimality,  which  test  is  one  of  the  few  available  constructive  means  fen 
determining  such  optima  [23],  e.g.  efficient  DMUs. 

Data  Envelopment  Analysis  determines  "facets"  of  similar  efficient  DMUs  in  estimating 
the  unknown  economic  production  function  in  the  neighborhood  of  each  DM U  with  respect  to  the 
overall  referent  set.  "he  input  surpluses  and  output  shortfalls  of  each  DMU  that  DEA  determ  ires, 
rather  than  the  precise  numerical  score  assigned  to  inefficiency,  are  the  central  valuable  results 
Statistically,  or  stochastically,  DEA  deals  with  estimation  from  samples  in  situations  of 
uncertainty  (wherein  the  joint  distribution  of  the  random  variables,  input  surpluses  and  output 
shortfalls,  are  unknown)  rather  than  classical  situations  of  risk  (wherein  the  joint  distribution 
is  known)  "he  stochastic  models  of  Aigner,  Schmidt  and  others  have  dealt  only  with  risk  "he 
difference  this  makes  in  difficulty  and  lack  of  statistical  theory  may  be  appreciated  from  the 
work  of  Dvoretsky,  Kiefer  and  Wolfowitz  ( 1 952)  on  the  much  simpler  Inventory  Problem  [24], 
[25]  with  known  function.  Thus,  statistical  theory  development  is  an  important  challenge  for 
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